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Summary — Based on expert opinions, informative prior elicitation for the common Weibull 
lifetime distribution usually presents some difficulties since it requires to elicit a two-dimensional 
joint prior. We consider here a reliability framework where the available expert information 
states directly in terms of prior predictive values (lifetimes) and not parameter values, which 
are less intuitive. The novelty of our procedure is to weigh the expert information by the size 
m of a virtual sample yielding a similar information, the prior being seen as a reference poste- 
rior. Thus, the prior calibration by the Bayesian analyst, who has to moderate the subjective 
information with respect to the data information, is made simple. A main result is the full 
tractability of the prior under mild conditions, despite the conjugation issues encountered 
with the Weibull distribution. Besides, m is a practical focus point for discussion between 
r^\ analysts and experts, and a helpful parameter for leading sensitivity studies and reducing 

£> the potential imbalance in posterior selection between Bayesian Weibull models, which can 

be due to favoring arbitrarily a prior. The calibration of m is discussed and a real example 
is treated along the paper. 
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1 Introduction 

H 

The versatile Weibull W(?7,/3) distribution, with density function 



fw(thP) = £(±\ expj-Q) 1 l >o } 

(77, /3) £ (0,oo) 2 , is one of the most popular distributions in reliability and risk assessment 
(RRA) and many other fields, mainly chosen for modelling the lifetime T of an industrial 
system or component £ [39] . In real-life studies, a Bayesian framework has often been 
highlighted when expert knowledge is available on E and observed lifetime data t n = t\, . . . , t n 
are small-sized and possibly contain missing or censored values [2] • Such contexts are usually 



encountered in industrial studies, especially when economical opportunities imply replacing a 
range of components at the same time, although they could have carried on running, and lead 
to "polluted" (e.g., censored) lifetime data. In those cases, all relevant sources of knowledge 
as expert opinion must be taken into account. 

Therefore, numerous authors [341 1351 |S] have focused their work on the elicitation of 
a joint prior measure 7r(ry, j3) that formalizes the expert knowledge, in order to integrate 
some decision-making function over the joint posterior distribution of these parameters, with 
density 



k(v,PK) = 



C(t n ; V ,P)TT(r,,/3) 



£(t n ;77,/3) 7r(?7,/3) drjd/3 
ml 



where C(t n ;r),f3) denotes the data likelihood. There are two main difficulties using the 
Weibull distribution. First, its only conjugate prior distribution is continuous-discrete [3TSJ 
and remains difficult to justify in real problems [20] . Second, the meanings of scale parameter 
7/ and shape parameter (3 greatly differ. Their values and correlation remain hard to assess 
by non-statistician experts, even though historical results |23j can be used to provide pref- 
erential values as a function of the behavior of the studied system. The methods proposed 
by the previous authors can suffer from this second defect and be applicable with difficulty. 
Therefore Kaminskiy & Krivtsov |20j recently provided a simple procedure to elicit a prior 
77(77, P) using expert knowledge about the mean and standard deviation of the cumulative 
distribution function (cdf) Fw They insisted on the fact that these values are easier to 
assess than parameter values. 

However, especially in sensitive areas like nuclear safety [42J . resorting to a Bayesian 
framework for reliability-based decision-helping implies defending the metholodology of prior 
elicitation in front of control authorities, the traditional difficulty being the treatment of its 
subjective aspects |16j . According to most wishes expressed by decisionners in our practice, 
the elicitation of a "defendable" prior tr(8) (of any model parametrized by 8 and not only 
Weibull) should respect the following items: (a) the quantity of subjective information can 
be directly compared, in terms of percentage, to the quantity of objective (data) information; 
and (b) the elicited prior must be unique. 

One could add to this list other wishes as the practical handling of the prior (explicit 
features and easy sampling), which is of importance for sensitivity studies. Note that the 
first item requires to give a clear sense to the words "quantity of information" . Statistical 
definitions like inverse Fisher matrices or Shannon entropies are often too technical to be 
directly accessible to decisionners. 

This article addresses those concerns. In the sequel, we consider an alternative elicitation 
of 7r(?7, /3) defined as the reference posterior of virtual data of size m but calibrated from 
lifetime magnitudes directly given by experts, pursuing the worry of realism expressed by 
Kaminskiy & Krivtsov 120] . Since the virtual size corresponds to an intuitive measure of 
prior uncertainty, ratios of virtual and observed data sizes bring an understandable sense 
to the notion of "relative quantity of information" . It can appear simpler than standard 
deviations (or other typical statistical uncertainty measures) to discuss with non-statistician 



experts and, especially, lead to more transparent choices to decisionners. 

The structure of the paper is as follows. The full prior elicitation is detailed in (the 
largest) Section [5] This methodological section focuses on the calibration of hyperparam- 
eters, the aggregation of independent expert opinions and the equitability issues between 
Bayesian Weibull models. Posterior computation is considered in Section [3j A numerical 
application on a real case-study is treated along the paper to illustrate the methodology. A 
Discussion section ends the paper, presenting alternative results and some avenues for future 
research. 



2 Prior elicitation 

2.1 Principle 

The central idea of prior elicitation comes from a simple vision of informative expert opin- 
ion already suggested by Lindley ^26 . We suggest to consider that a perfect expert opinion 
should be, roughly speaking, similar to a real data survey, and provide an independent and 
identically distributed (i.i.d.) sample t m = (ti,. . .,t m ) of lifetime data. Now, let tt j be 
a well- recognized formal representation of ignorance (namely, a noninformative prior), per- 
ceived as a reference benchmark measure on the parameter space. Assuming t m is known, 
the corresponding prior w should be the posterior distribution with density tt j \rj, /3|t m ). 



Posterior priors. Priors built as virtual posteriors present some advantages in subjective 
Bayesian analysis. First, they are unique since only defined by ir J and the likelihood (often 
historically chosen in experiments). Second, the correlation between parameters is automat- 
ically assessed through the Bayes rule. Third, as said before, the ratio between the numbers 
of virtual and real data helps to yield an understandable answer to the (often unclear) ques- 
tion "what is the ratio between subjective and objective information" asked by cautious 
decision-makers. Finally, the aggregation of independent expert opinions is simply carried 
out through successive Bayes rules, the consensus virtual sample being an aggregation of all 
virtual data. This avoids choosing opinion pooling rules which can suffer from paradoxes 

EDI- 

Maybe the most famous of such elicited priors is Zellner's g— prior [13] ■ Among others, 
Clarke [12] . Neal (25], Karny et al. [21], Lin et al. [35] and Morita et al. [55] examined various 
quantification of priors using virtual data. Kontkanen et al. [22] considered virtual data as 
practical tools for eliciting priors for Bayesian networks which may require automation in the 
treatment of their parameters. 

However, because of various factors, especially subjective ones, the sample t m is not di- 
rectly elicitable from an expert, and his or her information on lifetime T must be summarized 
through questioning processes [30] . This type of elicitation remains simple for distributions 
belonging to the natural exponential family, for which the resulting posterior priors are con- 
jugate (cf. 32J, § 5.3.3), because the virtual sample can be replaced by exhaustive statistics. 



In the continuous Weibull case, unfortunately, the only exhaustive statistic is the full virtual 
likelihood. Therefore the questioning must be oriented such that it allows the calibration of 
nonexhaustive virtual statistics. 

Expert questioning. In a concern of realism, following the ideas promoted by Kadane 
& Wolfson [TS] and especially Percy [3Tj in the field of RRA, we consider that an expert is 
mainly capable of providing observable information on T, unconditionally to (j],0). Indeed, 
the experts are usually not statisticians and should yield information independently from 
any parametrization choice (and even from any sampling model choice) made by a Bayesian 
analyst |21j . In other terms we assume that any realistic statistical summary of an expert 
opinion should be defined with respect to its associated prior predictive density, namely the 
prior density of plausible lifetime values 



/*(*) = // fw(t\fl,PMri,0)dvdp. (1) 

The density form of de Finetti's representation theorem [14] is then invoked to ensure the 
unicity of priors elicited in this way, under mild conditions of exchangeability for sequences 
of values ti, ■ ■ ■ ,t n , . . . See Press [32], § 10.5, for more precisions. Linking typical magnitudes 
of the observable variable T with statistical specifications can be made through decision- 
theoretical arguments. In the following, we consider experts who can answer to a question 
similar to the following one: 

Can you give estimates of relative costs {c\,c%) linked to two reliability-based decisions 
induced by the two mutually exclusive events T < t a and T > t a ? 

provided t a is given by the analyst (to improve the phenomenon anchoring of the expert 
and diminish subjective bias, cf. |41| ) and denoting a = c\j(c\ + c%) € [0,1]. Doing so, 
following the standard criterion of decision theory, namely the expected utility |33j . the analyst 
interprets t a as the minimizer of the predictive Bayes risk 



argmin / A(t ,t\c l7 c 2 )U(t) dt. 
t >oj 



defined by some loss function A(ig, tjci, C2) between the choice T = to and the unknown truth 
T = t, inflicting c\ to the event t < i (underestimation) and c 2 to the contrary event t > t 
(overestimation). The common choice 

A(to)*|ci,c 2 ) = \t- t 1 (ci • l{t<t } +c 2 • l{t>f }) ) 

which underlies the analyst wants to penalize similarly small and large misestimations |33j . 
leads to t a taking the sense of the a— order prior predictive percentile 

PAT <t a ) = ^ U(t) dt = a. (2) 

Jo 

Therefore an alternative equivalent query is, perhaps simpler, what is the risk a for S to 
break down before t a ? 



In the following, we assume finally that for each available expert, a unique specified cou- 
ple (t a ,a) among all elicitable can be considered as his or her most trustworthy specification 
(MTS) and must be exactly respected in the effective predictive prior modelling. Various 
reasons can be invoked for this. First, one cannot hope to elicit a prior ir(rj,f3) such that 
an arbitrary large number of specified couples (t a , a) be exactly respected together, because 
of the limited flexibility of parametric distributions (Berger 1985 |3], chap. 3). Second, a 
MTS often appears as a reality since experts have usually more difficulty to speak in terms 
of extreme values rather than values close to the median behavior [30] . Typically, they can 
share a similar MTS while their extremes can differ (see Example [I]). Other arguments can 
be related to decision-making: a cautious, conservative couple (t a ,a) can be favored by the 
analyst since the posterior analysis is focused on percentiles of higher, more critical orders. 

Example 1. TableUj already used in f$[, summarizes two prior opinions about the lifetime 
T (in months) of a device belonging to the secondary water circuit of French nuclear plants. 
According to a large consensus in the RRA field, T is assumed to be well described by a Weibull 
distribution. Giving a normative sense to extreme events (90% credibility), these experts were 
not questionned at the same level of precision. E\ is a nuclear operator and spoke about a 
particular component, in terms of replacement costs. Conversely, £2 is a component vendor 
whose opinion took into account a variety of operating conditions. Costs invoked here were 
mainly related to mass production. Therefore the two experts can be considered independent. 
Hence the common median appears as a robust specification and is chosen as the MTS for 
both. 

Credibility intervals (5%, 95%) Median value 



expert £1 [200,300] 250 

expert £ 2 [100,500] 250 

Table 1: Expert opinions about the lifetime T of a nuclear device (in months). 



2.2 A comfortable prior form 

Let us choose w J as the Jeffreys prior for Weibull. In more general Bayesian settings, Sun 
|37j proposed to favor the Berger-Bernardo reference prior [5] since it has slightly better 
properties of frequentist posterior coverage. But this prior requires at least to > 2 to get 
proper posteriors. This would be limiting in practice, when to is chosen small as it could be 
expected in cautious subjective assessments. Moreover, expert knowledge exerts here itself 
on T and not on any Weibull parametrization, therefore it seems relevant that a benchmark 
tt j be parametrization- invariant. See [12j for a straightforward defence of Jeffreys' prior in 
related problems, where a subjective posterior has to be compared in information-theoretic 
terms to an objective posterior. Thus we consider 

7r J (ry,/3) ex ?? _:L l{,,>o}l{/3>/3o} 

where /3o > is assumed to be fixed by objective reasons, like physical constraints. For 
instance, a reliability study focusing on industrial components submitted to aging leads to 



choose Po — 1 as explained in Bacha (1998) [2J, since involving a time-increasing failure rate. 
Without particular constraint, /3o = 0. Scale invariance imposes no other lower bound than 
for r). 

Denote QTQ(a,b,"/) the generalized inverse gamma distribution with density 

Reparametrizing x in /i — a;" 7 , [i ~ Q(a,b). Then our ideal prior is 7r(?y, /3) = 7r(^|/3)7r(/3), 
such that 

tjI/3 - gig(m,b(tm,P),P), (3) 

^ k ^(U) exp r ^o J *^°> (4) 

with 6(t m ,/3) = X)t=i^f' an d /^(tm) = w(^^ilogii) -1 . Both distributions are proper for 
all to > 0. The unknown virtual unsufficient statistics b(t m , (3) and /3(t m ) must be replaced 
in function of available expert information. The linkage between the prior form promoted in 
(3|4) and a MTS (t a ,a) elicitable for a given expert can be done as explained in the next 



proposition (proved in Appendix) and its corollary. 

Proposition 1. For a <G]0, 1[ and t a > 0, define the function b a : IN* x M* + by 

b a (m,(3) = ((I - a)- 1 '" 1 - I)'' tj . (5) 

Then b a {m, /3) is the only ft— continuous function such that, being substituted to 6(t m ,/3) in 
pi), Equation M) is verified almost surely. 

An immediate and pleasant consequence of replacing deterministic expression &(t m , j3) by 
b a (m,/3) is that n(/3) oc /3™ l_1 exp (— m(3\ogt a + m/3//3(t m )) 1{tj>o}- We recognize here the 
general term of a gamma distribution truncated in /3q . Finally the resulting prior is 

ri\p - ffZg(m,b a (m,p),0), (6) 

(7TL \ 
m ' ^7—, ^ZM (7) 

f3(m)J 

where /3(m) — (logt a — /3 _1 (t m )J . This result deserves some technical remarks. 
(i) The joint prior propriety imposes /3 _1 (t m ) < logt a , namely n"=i h < ^a- 



(ii) The joint prior ( 6|(7 ) can remain proper for all m extended on the half-line M* + . Thus 



fuzzy or doubtful experts can be graded using m < 1. This might be valuable if a group 
of P experts is considered as yielding less information than P i.i.d. data, for instance 
because they are suspected of mutual influence. 



(iii) The QXQ(a,b,j) distribution was firstly used by Berger & Sun jB], b being assessed 
independently of /3. However, this choice was made only because of the posterior 
conjugate properties conditionally to /3 (see § pj), and no meaning was given to the 
hyperparameters. Authors like Tsionas |39j adopted similar approaches. 

2.3 Prior calibration 

In addition to the MTS needed to define the prior form fnfh , supplementary prior information 
must be available for the calibration of (m, f3{m)). In the two following paragraphs we 
consider some cases commonly encountered in RRA. 

2.3.1 Calibrating 0(m) 

In RRA, it can occur that the analyst benefits from qualitative information on the nature of 
aging of S. For instance, assuming /3q — 0, if the expert can answer the question what is the 
probability < ot.fj e < 1 that £ is submitted to aging?, one would have a priori P(j3 < /3 e ) = 
1 — a / 3 e with j3 e = 1 and consequently 

p{m) = 2m/? e /xL(l " «/9.) (8) 

where Xm(l) ^ s the q— order percentile of the \m distribution. Other similar questions can 
be asked over accelerated aging (f3 e = 2) and extreme cases (/3 e = 5) reflecting inconceivable 
kinetics of aging in industrial applications [23] • Otherwise, databases of typical /? values 
(e.g., http://www.barringerl.com/wdbase.htm) can be used to quantify some alternative fea- 
tures of the gamma prior. 

However, most frequently (as in Example IlJ, other quantitative information is available 
under the form of a single or several credibility intervals, one of whose bounds is the pre- 
viously chosen MTS. We consider p > 1 supplementary (non-independent) specifications 
Qp = {t ai i&i}i£{i,...,p}, sorted by increasing order (<Xi < a»+i and t ai < t ai+1 ). Given m, 
calibrating /3(m) under those predictive constraints can be done by minimizing a distance 
2? m (/*,/„.) where /* is a pdf of T respecting exactly the MTS and the specifications listed 
in ftp. To avoid dealing with the infinite number of possible /*, we adopt the approach 
proposed by Cooke [13] : V is chosen as the discrete Kullback-Leibler loss function between 
required and elicited marginal features 

i=Q Vfx y 1 t [ l ai, l a i+1 \) 

El \ 1 ( a i+l _ a i) rn\ 

(a. l+1 -a l )\og T1 - ) -^ (9) 



- J e ) - n ^ .. - J e ) - 



where t ao = 0, t ap+1 = oo, a Q = a { e> = 0, a p+1 = a^ = 1, and for i G {l,...,p} 

4 e) = [[F w (t ai \ V ,/3)Tr(r)J)dr)dj3. 



7 



The convexity of this loss function in its argument 7r and, given m and /3, the one-to-one 
continuous correspondence between 7t(/3|/3(to)) and /3(m) allows for a unique solution of the 
calibration problem 

/3*(m) = arg min P m (/*,/»). 

7r(.|/3(m)) 

From nSh , estimating j3*(m) is similar to select ap c = 0.5 and minimizing (|9j) in the prior 
median /3* . This provides a direct view of the underlying aging and numerical estimations 
were found slightly more robust than those of the prior mean, or those of the best order a% 

if, conversely, j3 e is fixed. Therefore we temporarily note /?(m) = /3 m (/3 e ). For a given m, a 
combination of golden section search and successive parabolic interpolation [TU] can achieve 
a robust optimization of /3 e , provided the af are smoothly computed at each step of the 
algorithm. A smooth Monte Carlo estimation can be obtained using a unique importance 
sampling run /3i, . . . , /3a/ ~ G(m, m//3 e ,o), with large M, where /3 e ,o is a chosen starting point: 



a, (e) (m,&) c 


t 1- 


( Pe,0 \ 


m 1 M 

-Y 

M ^ 

3=1 


1 1 1 t( * 1 1 C"D 1 fl 


1 


1 


[l v 1+ 6 Q (m,/3,) y ) eXP ^ 


~K~o 


Note that 















Err(m) = © m (/*,4 



(■l/9*M) 

measures the expert incoherency with respect to the predictive Weibull distribution, given a 
virtual sample of size m in agreement with the expert opinion. If Err(m) remains large for 
many to, the Weibull choice for the virtual data (and therefore for any real dataset, provided 
the expert is relevant for the problem) is at least debatable, not to say probably inappropriate. 



2.3.2 Calibrating m 

The calibration of to must be adapted to the experimental context. A decisionner can impose 
a given virtual size to improve the clarity of the posterior result. For instance, Marin and 
Robert |27| proposed to give to the virtual size parameter of Zellner's g— prior (on regressors 
of a gaussian linear regression problem) the value to = 1 by default. In a similar context, 
another possibility, proposed by Celeux et al. [IT] and Liang et al. (2l| among others, is to 
establish an upper hierarchical level in the Bayesian model by considering to as a random 
variable for which a weakly informative prior must be elicited. A last possibility is to use to 
as a discussion tool between the analyst and the expert, since the meaning of to is under- 
standable outside the statistical field. Some heuristic methods in this sense are discussed in 
i- 

However our aim as a Bayesian analyst is mainly to measure the strenght of the expert 
opinion (assumed being correctly reflected through the prior modelling) through to. Besides, 
when the experts are no longer questionable and only a summary of their past opinions 



remains available, it seems somewhat difficult to elicit a hyperprior on this parameter. Then 
we suggest that m should be integrated as the minimizer of the expert incoherency risk, 
namely 

?7i* = arg min Err(m). 

m>0 

It is the analyst's decision to minimize this risk on IN* (not to loose the virtual size meaning) 
or ]R* + . In our experiments we chose IR* + to get the closest prior to the expert opinion. Ob- 
viously, one can avoid eliciting a too informative prior by limiting the minimization domain 
to (0,n]. 

Example 2. ('pursuing Example [h). For a continuum of values of m, we display on FiguresUj 
andVAthe optimized j3*{m) and the corresponding risk Err{m), respectively, for both experts. 
In both cases, the shape of 'to i— > Err(m) allows for a unique solution to*. We find to* = 3.36 
for expert £\ and to* — 2.50 for expert £2- This is logical since £\ is more informative than 
£2 ■ However, all values /3* (to) for expert E\ appear unrealistic in an industrial physical con- 
text, testifying from exponential uncontrolled aging, assuming the Weibull model is correct. 
Especially, the calibrated j3* (to*) = 16.5, which induces a peaked normal behavior of the prior 
predictive distribution (cf. fj<5j /). On the contrary, the opinion of expert £2 remains physically 
plausible (f3*(rn*) = 4.9) although the underlying aging is still strong. The corresponding cov- 
erage matching error |1 — (cq^ — ocf 2 )/90%|, where a t e is the effective percentile order for 
t,i G {100, 200} or ti G {300, 500}, is plotted in Figure\M and shows a good adequacy between 
the wanted and effective credible domains: less than 5% error in all cases, and less than 0.2% 
and 0.004% when choosing the calibrated to*, for experts £\ and £2 respectively. 



2.4 Aggregation of independent expert opinions 

In cases when the aggregation of i = 1, . . . ,p priors is chosen as a way to avoid interacting 
biases in a group of experts, it yields a similar information to that carried by a global virtual 
sample, which is the union of all experts' samples t mi . Because they are not explicitly known, 
one may use a concatenation of known samples s mi , . . . ,i m from another model M.{j],fi) 
such that their parametric likelihood (77, 0) H> £(s mi , . . . \rj, (3) leads to the same inference as 
the whole virtual sample. Indeed, we can show easily (cf. Lemma [I] in Appendix) that 



n J 



(j7,/3|t mi ,---,t mp ) = tt J m (?7,/3|i mi ,...,s mp ) ex 7r J (77,/3)J|^(s mi |?7,/3). 



i=l 



Next proposition, proved in Appendix, gives an example of such a likelihood (said virtual 
likelihood) for a single expert opinion. 

PROPOSITION 2. Consider s m = (k a , m ,/3t a ,m), where k a ^ m = ((1 - a) -1 /™ — 1) _1 and 
Pt a ,m = /3(m)/(l + j3(m) log t a ), as a sample whose components follow independently the 
Q(m,m(t a /ri)P) and IQ(m,mf3) distributions, respectively. Then it defines a virtual likeli- 
hood for an expert opinion summarized by (t a ,a, /3(m)). 




Expert E1 

- - Expert E2 



10 15 20 

virtual size m 



virtual size m 



Figure 1: Calibrated prior mean f3*(m) in 
function of m, for each expert opinion sum- 
marized in Tabic [T] 



Figure 2: Expert incoherency risk Err(m) 
in function of m, for each expert opinion 
summarized in Table [l] Minima of plots 
indicate calibrated values m* . 



Expert E1 
Expert E2 








5 10 15 20 25 30 

virtual size m 



200 300 

lifetime T 



Figure 3: Prior predictive coverage match- 
ing error in percentage. 



Figure 4: Prior predictive densities for 
both expert opinions and the aggregation 
of opinions. 
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After simple algebra, the resulting prior for all expert opinions is of the same form (|6|7j) , 
for which (respecting intuition) m = X^=i m i' b a (jn, P) — X^=i b ai ( rn ii P) an d 

B(m) — m > -^ — * — 

EXAMPLE 3. ('pursuing Example El). Denote (tti,^) the priors calibrated in Example \E for 
each expert. Denote tt^ the aggregating prior. Although 7r 3 appears less relevant than 7T2 
with respect to a Weibull model in a RRA context, we have no supplementary information to 
weight the strenght of its corresponding virtual sample in ir^ . Then, by defect, ir^ is defined 
by 

m = 5.86, b a ((3) = 7.33 • (250)' 3 , J3 = 8.30. 

Corresponding prior predictive densities are plotted on FigureUl As it could be expected, the 
aggregation prior realizes a trade-off between the two priors in the sense it favors the common 
median according to an intermediate peak, due to the addition of spread virtual data (expert 
£2) to concentrated virtual data (expert £\). 

2.5 Prior equitability among Weibull models 

Weibull models are often used as bricks for more general reliability models, like competing 
risk models [7j or mixtures 0D]. Especially, a usual challenge in RRA to choose between 
exponential and Weibull models. Since the exponential is nested into the Weibull model 
(/3 = 1), a simple likelihood ratio test can be carried out in the frequentist framework. A 
Bayes factor is also easy to compute in our framework. Logically, both models share the 
same prior elicitation method, with the same MTS. 

Then denote (m£,mvi/) the two corresponding virtual sizes, (tte^w) the associated 
priors, and assume the more complex prior ttw has been calibrated. How should the be 
calibrated such that none of the prior Bayesian models is arbitrarily favored in absence of 
real data? The problem of defining such a prior equitability to reduce bias in posterior 
selection has been considered by many authors (see |llj for a review), who proposed several 
rules. Celeux et al. (2006) [IT] gave decisive arguments to calibrate tte such that it minimizes 
the Kullback-Leibler divergence between predictive distributions 

m% = arg mB minKL(/ 7rvv ,,/ 7rB (.|mB)) 
where, after simple algebra, 

b™ E {m E ,l) 



U E {t\m E ) = m E 



(b a (m E ,l)+t) mE+1 ' 



A unique Monte Carlo sampling of f^ w can be used to get a smooth description of the KL 
divergence and its derivative in m, so that a coupled Newton-Raphson method can provide 
a good estimate of m* E . This strategy can be carried out on more complex Weibull models, 
sorting them through their decreasing order of degree of freedom. If all authors have empha- 
sized the difficulty of this task when nested models are nonconjugate with multidimensional 
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exponential virtual size 

Figure 5: KL distance between prior pre- 
dictive Weibull and exponential distribu- 
tions dedicated to expert SzS opinion, in 
function of the exponential virtual size the- 
The minimum is reached in m%. 



2 3 

Weibull virtual size 



Figure 6: Correspondence between Weibull 
and exponential virtual sizes (mw, m* E ) for 
the expert £Vs opinion. 



parameters, our framework leads to a rather simple optimization. 

EXAMPLE 4. ('pursuing Example^. For various values of tue, the KL divergence is plotted 
for expert £ 2 on Figure [5l The KL convexity allows for a unique solution m* E = 6.70. 
When modifying Tnw> the correspondence between mw an d m *E * s plotted in Figured A 
similar calculus for expert E\ leads to a very high value of rn E (upper than 200), because any 
exponential predictive distribution cannot approximate well a peaked normal distribution. An 
exponential assumption thus appears deeply irrelevant for this expert opinion. 

It could have been expected that m* E < m^ since the more complex Weibull model should 
need more data that the exponential one to describe the same prior information. However, the 
model simplification reduces the global uncertainty in the effective prior predictive distribution. 
This has a direct impact on the virtual size m E which varies inversely to uncertainty measures. 

3 Posterior inference 

Thanks to the considerable development of numerical sampling methods, posterior computa- 
tion is no longer burdensome in two-dimensional cases. Nonetheless, the conditional conjuga- 
tion prior properties simplify the work of the Bayesian analyst. To be general in the RRA area 
and in relation with ExamplepjJ we assume that observed data t n = (t±, . . . , t n ) contain r i.i.d 
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E- =1 lo; 

such that 



uncensored data ij" , . . . ,tf and n — r right-censored data. Denote St n {P) = Ym=i tf an d 

r 



/3t n = r /X^=il°g*i ■ Then the joint posterior distribution has density 7r(?7|/3, t n )-7r(/3|t n ), 



77|/3, t n - gig(m + r,6(m,/3) + ,5t„(/3),/3), 
,„, , &™(ro,p , )p ,m+r '- 1 f V m r \\ , 

(b a {m,(3) +<)t„(p)) I \p(m) Pt„/J 

It is enough to obtain approximate posterior sampling of /3 to get a complete joint sampling 
(using Gibbs sampling for 77 conditional to /3). This can be made efficiently via the adap- 
tive rejection sampling algorithm from Gilks and Wild [18j . In a noninformative context 

(i.e., when 771 > + ) which can be easily adapted to a more general setting, Tsionas [35] 

proposed a gamma instrumental distribution p((3) whose mean is calculated to optimize the 
acceptance rate. 

A particular attention must be paid to the existence of posterior moments, especially the 
first one which defines the Mean Time To Failure (MTTF) in a RRA context. The conditional 
posterior mean of 77 is 



Efolfttn] = A(r,m,0)(m-r£ ifi +r-fify 



1//3 



with 77 e = k a ,mto,/m, 77 = r _1 Y^i=i h the conditional MLE, and A(r, m, (3) — F(r + m — 1/ f3)/T{r + to), 
so that the MTTF is not defined if /3o < l/(m+r). More generally, using an important result 
of Sun and Speckman [5S] (proof of Theorem 5), one can prove that the fc th moment of the 
posterior predictive density 



E[T fe |t n ] = ff 77 fe r(l + fc//3)7T(7 7 ,/?|t n )d7 7 d/3, 

JJm? 
ex / r(l + k/0)T{r + m - k/p)n(J3\t n ) d/3, 



J1R+ 

exists only if (3q > k/(r + to) for any k > 0. This result is especially useful in the sense it 
gives to the analyst a necessary requirement on the prior precision to justify the practical 
handling of the posterior predictive distribution through usual statistical summaries, in re- 
gards of the information available from really observed data. In mild conditions (r > 5 and 
fc < 2), choosing a defect /3q — k/r appears as a practical calculus artifice, and the more 
justified choice /3o = 1 in aging studies is sufficient to ensure in practice the existence of 
posterior predictive moments. 

EXAMPLE 5. ('pursuing Example Up. We consider the right- censored lifetime data t n (n = 
18, r = 1Q) from Table [M They correspond to failure or stopping times collected on some 
similar devices ^ close to the one considered in Example \n The maximum likelihood esti- 
mator (MLE) is (f]n,fin) — (140.8,4.51) with estimated standard deviations a n = (7.3,1.8). 
This strong aging is in agreement with the opinion of expert £2 ■ 

Choosing /3q = 0.1 does not modify significantly the calibration of priors for both experts, 
and the resulting posterior distributions of the MTTF are plotted on Figure [?] The peak 
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Figure 7: Posterior distributions of the 
MTTF for both experts and their aggrega- 
tion, and in a noninformative framework. 



Figure 8: Posterior predictive densities for 
both experts and their aggregation, and in 
a noninformative framework. 



observed when modelling a noninformative expert is due to the concentration of data far from 
time regions favored by the priors, and logically the aggregating prior, as the most informative, 
shifts the MTTF to the highest values. 

The calibrated opinions of experts £\ and E-i have a relative weight of respectively 34 % and 
25% of the real data information transmitted to the posterior distribution, but, as it could be 
expected, their optimism in terms of lifetime has a strong influence on this important func- 
tion of interest. Note however that due to variance increasing, the left tails of the posterior 
predictive pdf (Figure pD are upper than the tails of a noninformative posterior predictive pdf. 
This means that, given a small t 7 (for instance a replacement time), the posterior estima- 
tion of '7 = P(T < t-y) will be slightly overestimated - if we add the experts' opinions - with 
respect to this given by an only data-driven prediction. On the contrary, when i 7 is high, the 
noninformative posterior can appear somewhat too conservative. 



real failure times 134.9, 152.1, 133.7, 114.8, 110.0, 129.0, 78.7, 72.8, 132.2, 91.8 

right-censored times 70.0, 159.5, 98.5, 167.2, 66.8, 95.3, 80.9, 83.2 

Table 2: Lifetimes (months) of nuclear components from secondary water circuits. 
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4 Discussion 

The elicitation of a multidimensional prior, perceived as a reference posterior conditional to 
virtual data supposed to reflect a perfect expert opinion, is a practical way of assessing indi- 
rectly the correlations in the parameter space, coherently with the sampling model. Another 
important gain is the possibility of assessing the prior uncertainty in an understandable way 
by modulating the virtual size, for instance for sensitivity studies. This indicator of prior 
information might help to increase the trust of a decision-maker in the posterior beliefs and 
the acceptation of Bayesian assessments by control authorities. Unfortunately, such priors 
are often untractable since they require to assess nonexhaustive statistics of the virtual data. 
This is especially the case with the Weibull models. 

In this article, however, we showed how this issue can be overcome, replacing those un- 
tractable statistics with functionals such that the resulting prior answers to the statistical 
specifications of the prior knowledge, under the form of percentiles. Note that other alter- 
natives to percentiles could have been considered: for instance, following Percy |31j . assume 
an expert can provide an estimate t e of the marginal MTTF or the mode Md[T], namely 

MTTF = E[T] = // E[T\ V ,j3]f w (t\r)JMr),j3)d V dl3, 



Md[T] = arg t>0 max// f w {t\ V , P)^, (3) dr,d(3. 

J JlR 2 + 

The first estimate t e is thus related to a quadratic loss function A(tojt|ci,C2) which assumes 
the equality of costs c\ — c^ and a penalisation increasing with |£q ~Ai while the second can 
be explained by the limit of a series of binary loss functions [33]. Following the principle 
described in Proposition nl one should replace b(t m ,j3) in (13| by, respectively (see [5] for 
details), 



e ' 



***> ■ ( r(i + i/wLi/fl ) * 

Km, ft) = ("P^tl 



(5-1 

However, 7r(/?) is no longer (but remains close to) a gamma density, and furthermore these 
two specifications require conditions over m and the domain of variation of 7r(/3) to be usable. 
Indeed, one must guarantee f3 Q > 1 to ensure b(m, /3) is well defined when t e is specified as 
a unique mode. This is coherent with the Weibull features, since the Weibull distribution 
has a unique positive mode if and only if (3 > 1. As explained before, assuming aging is an 
equivalent prior constraint placed on the model. 

Since assuming a prior predictive percentile can be provided by an oriented questioning, 
and fortunately leads to an explicit and versatile joint prior on Weibull parameters, we sug- 
gest Bayesian reliability analysts should favor, as much as possible, this kind of elicitation. 
This agrees with the vision historically promoted by Berger [3] (chap. 3) and Percy [3Tj . 
who considered that quantile-based approaches pose among best elicitation methods, the 
estimation of probabilities of localization in given areas being simpler for experts than the 
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assessment of statistical moments. 

Along the paper some remaining issues and limitations of the prior modelling have been 
evoked, which are now discussed as potential avenues for future researches. These researches, 
besides, will be dedicated to extend this methodology to other models which are often used 
in reliability studies, especially extreme value models whose links with Weibull distributions 
are well known. 

It appears firstly that checking for the appropriateness of the Weibull model with respect 
to the virtual data is a crucial task, since it allows to reveal divergences between an expert 
opinion and the common sense of reliability practicioners when assuming the Weibull dis- 
tribution for the lifetime of an industrial component submitted to aging. As the case for 
expert E\ illustrates, providing a small prior credibility interval can underlie unrealistic val- 
ues for the Weibull parameters. This agrees with the well-known behavior of RRA experts 
of underestimating their self- uncertainty |23j . Therefore we suggest that spreading the prior 
credibility orders such that a qualitative requirement is reached (e.g., P-n-QS < 2) = 0.5) can 
give a more reasonnable summary of the real knowledge of E. 

Another issue deals with the remaining uncertainty in expert opinion. In this article, we 
proposed a simple definition of the ratio of subjective and objective information based on 
virtual and real data sizes, understandable outside the community of statistics. It is clear 
however that information quantities updated through Bayesian inference are also strongly 
dependent on the possible conflicting issues between prior and real data, in the sense that 
both can favor regions of the sample and parameter spaces which are far from each other 
[17| |9] . Tools proposed in these two references should be carried out to check the internal 
coherency of the Bayesian model beforehand. 

Besides, we did not consider here the remaining difficulties occuring when the expert can 
be suspected of bias, for instance because of motivational reasons [3] or dependency within 
a group. Many other tools, based on test experiments, have been proposed in the literature 
in order to quantify those biases (e.g. [3H E51 US])- However, in absence of supplementary 
information, we prefered respecting the summarized expert opinions the best we could in our 
case-study 

Thus, the case of dependent experts has not been treated in this paper, since it remains 
controversial in Bayesian statistics [3U] and probably deserves specific studies in the RRA 
area. Pursuing our view, two experts are dependent if they share virtual data from their past 
experience, or if a part of a virtual sample is produced dependently from the other sample. 
Dependency could then be introduced through a hierarchical mechanism of data production, 
in the spirit of the supra- Bayesian approaches promoted by Lindley |26j . This theme will 
also be considered in our next works. 

In future studies, it will be necessary too to provide some calibration tools to take into 
account the expert uncertainty and bias. A first avenue can simply be to add a hierarchical 
level conditional to m. Indeed, we assumed here that the expert subjectivity mainly lies in 
the self-estimation of the costs associated with a reliability decision, and thus (using notations 
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from § 2.3) in the estimation of sorted orders a\ < . . . < a p . Let us denote a\ < . . . < a p 
these prior estimates. If we pursue the virtual size idea, it appears logical to consider the on 
as correlated random variables such that, a priori, 

ai,a 2 - ai,.. .,a p - a p -i,l- a p ~ V ir {v\, . . . , v p+ i) 

with ft — 1 being the number of virtual "past" observations of the event t ail < T < t ail 
which induces Xa=i v % = tn + p + 1, the Dirichlet distribution appearing naturally from 
well-known conjugation properties. Imposing E[a^ — «i-i] = on — cti-i leads to elicit 

Vi = (m + p + 1) (dti-ai-i). 

The obvious correlation between prior estimates a, threatens to underestimate the prior 
uncertainty of the o^, so that it appears more appropriate to help the expert providing 
conditional probabilities by answering to the following question: what is the risk ai — on-\ 
for Yi to break down before t ai knowing £ still runs after t ail ? Thus, the randomization 
of values («i, . . . , a p ) imposes indirectly a prior distribution 7r(/3*(m)) rather than a single 
value, and consequently, m should now be calibrated as the minimizer of the expected expert 
incoherency risk 



arg min E ,„,, -,-, T> m If*, f / ,?,, .\ 
m>o n/ 3 ("*)) L V V {-\P ( m )J 



But this calibration remains in facts difficult to carry out, since the untractability of 7r(/3*(m)) 
imposes a double Monte Carlo approximation coupled to an optimization strategy. Our next 
research will focus on simplifying this computational work. 

Finally, we remind to the reader that other elicitation approaches of percentile orders are 
possible, mainly based on the establishement of expert preferences on a series of bettings 
such that a is not directly estimated but is progressively bounded [30 . Such methods are 
robust with respect to perturbations of the often criticized expected utility criterion, since 
they can lead to results that are independent of the behavior of the expert face to his or 
her self-perception of the risk pQ . It thus should be worthy to adapt our modelling and the 
calibration aspects to this type of elicitation, which could lead to more cautious, credible 
statistical features of the prior modelling. 
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Appendix 



Proof of Proposition 

Then, renaming b(t m , 0) in b a {m,0) in (3|4) 



/3 m_1 / \ ( t& 

Denote f m (0) = —— — m exp m— »— and g m (B) = 1 + ° 

(b a (m,0)) \ 0{t m )J \ b a (m,P) 



PAT<t a ) = // F w (t a \r],0)7r(T)\p)7r(0)dTid8, 

J JM + xM + 

= 1-A m f f m (l3)g m {8) dp 

where A" 1 = j' f m (8) d/3 which must be necessarily finite to get a proper ir(B). Assum- 
ing P-k{T < t a ) = a leads to J M+ f m (0)h m {/3) d/3 = where h m (0) = g m {0) - (1 - a) 
which is continuous in > if i— > b a (m,0) is assumed continuous for any m > 0. Since 
H ^ fm(0) > except possibly on a finite number of points in M+, h m (0) = almost 
everywhere in IR+. Expression ([5]) follows immediately. 

Lemma 1. Denote Mi and Ai 2 t wo sampling models with same parameter 9. Denote 
t m . a non-observed Mi — sample with likelihood i M (\). Let s m . = g(t mi ) be an observed 
M 2 — sample with likelihood i M (i), such that 9 1— > £j^m(6) oc £ M (i)(9). Denote ir J a prior 

measure on 9 and 7r£. (i) its posterior knowing likelihood £ M u). Then for p various samples 
t mi ,--,t m , one has 



n MW (^l*mij---)t mp ) = ^(2) (0\ 



mi ) • • • j °m p 
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Proof. The proof is straightforward, and we can consider only two non-observed samples 
t mi and t m2 (corresponding possibly to two independent expert opinions). Then 

71-^(1) (<?|t mi ,t m2 ) OC £ M (1) (trnJ^TT^d, (6»|t m2 ) , 

oc £ M (2) (s mi \0) n J (9)l M (i } (t m2 \9) , 
oc £ M (2) {s mi \9)£ M (2) {s m2 \9)ir J (6), 
oc £ M (2) (s mi Ifl)?!-^) (0|s m2 ), 

OC n M(' 2 'i ("l S mn s m 2 ) 

and the equality follows by immediate normalization. 

Proof _of Proposition [2J The Weibull likelihood of the to virtual data summarized b 
(t a ,a, f3(m)) is proportional to £(f],f3) — ir(r), {3)/tt j (r], (3) where 7r(?y,/3) is defined by 
Modify the parametrization using /i = rf~ ' . Thus 



ea by 
ffi. 



£(fi,/3) oc n m exp(-[ib a (m,f3)) f3 m exp -m 

OC |M(*a) } exP {-Mfea.m (ta) J 
OC ^l(A a ,m|A».j8) 4»(ft a ,Jj8) 







/3(m) 



/3 m exp < — m 



/3(m) 



exp{— m/3 logi Q } 



where 



4(fca,m|A«,i3) oc 



r(m) 



'^a,m 6Xp s K a ,m (t-a) f 



which is the likelihood arising from considering fc Q . m ~ Q(m,m^(t a )^), and 

/> /tf |ft\ ( m /?) m flm-l J l+/3(TO)l0gi a } 

MPtc«,m|p) oc & exp< -TO/3- 



r(m) ' *"""• * [ /3(m) 

which is the likelihood arising from considering f3 ta .s ~ XQ(m,mf3). 
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